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NT 

PCT 



RAW SEQUENCE LISTING DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A TIME: 11:27:28 

Input Set : N:\EBONyS\US09830691A.raw.txt 
Output Set: N:\CRF4\09082003\I830691A.raw 

1 <110> APPLICANT: Choi, Eui-Sung 

2 Rhee, Sang-Ki 

3 Sohn, Jung-Hoon 

4 Park, Soo-Dong 

5 Lee, Yoon-Hyoung 

6 Lee, Seung-Jae 

7 Jang, Jae-Kweon 

8 Choi, Seok-Keun 

9 Son, Young-Rok 

10 <120> TITLE OF INVENTION: VECTOR FOR THE TRANSFORMATION OF PHAFFIA 

11 RHODOZYMA AND PROCESS OF TRANSFORMATION THEREBY 

12 <130> FILE REFERENCE: 118.12-US-WO 

C — > 13 <140> CURRENT APPLICATION NUMBER: US/09/830 , 691A 

14 <141> CURRENT FILING DATE: 2001-04-26 

15 <150> PRIOR APPLICATION NUMBER: KR 1998/46547 

16 <151> PRIOR FILING DATE: 1998-10-31 

17 <150> PRIOR APPLICATION NUMBER: PCT/KR99/002 65 

18 <151> PRIOR FILING DATE: 1999-05-29 

19 <160> NUMBER OF SEQ ID NOS : 20 

20 <17 0> SOFTWARE: FastSEQ for Windows Version 4.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 1223 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Phaffia rhodozyma 

26 <400> SEQUENCE: 1 



27 atggtcaacg ttcccaagac tegaegtgag ttatagcaat- ttcaacaact ctccagacga 60 

28 caaatattcc agtgcatcga aagagtttgt ggataaaege gacagtttca agggaaagag 120 

29 tcgatggaca gatttggaag acttagcegg tcaaggaact tggggatcac gtggcggagg 180 

30 actcatcaga agaagteggg atttgtttga tcatagtggg atcaagacaa actggaggat 240 

31 atggctcgcc ttggaaggga atctccggcc tggattcgag gatccgaaag ttgtacgtat 300 

32 ggaaaagctt acaeggcttg gatttattat ctttcatagg aacctactgc aagggtaagg 360 

33 ettgeaagaa geacaegtaa gtegcttate ctctccactc tttcatggca tattgtcaac 420 

34 gactggacaa cgcgtccgtt ttgaaacaag tgacttacct gtgaaatttg attctacacc 480 

35 tgtatttagc cctcacaagg tacatatcac atcctcccac cccaccctgc ccaacttctt 540 

36 cagttcatct tgctctcggt ttccacattc cctgatgacc tccttgtatg ttctttgega 600 

37 acgtttgttt ctgtttctgt ' aggtgaccca gtacaagaag ggaaaggact ccatcttcgc 660 

38 ccagggaaag cgacgatacg accgaaagca gtccggttac ggaggtcaga ccaagcccgt 720 

39 tttccacaag aaggctaaga ccaccaagaa ggtegtcett egattggegg tatttttgtt 780 

40 tattttgaat tctttttgtg tatgeagact . tttgatgatt atgctcctct gtcgtttttt 840 

41 ctcttcaaac agagtgetec gtctgcagtt cgttcttcct tccaaccaaa acttcaacta 900 

42 cagacatcat aaacagacat ettacttegg tgttctctct ttttttcege ag.agtacaag 960 

43 atgcagatga ccctcaagcg atgeaagcac ttcgagcttg gaggagacaa gaagaccaag 1020 

44 * ggttcgtctt ttgtccatat attctctggt tcacttctta tgttcctaac gtacttgttt 1080 
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RAW SEQUENCE LISTING DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A TIME: 11:27:28 



Input Set : N: \EBONY ' S\US09830691A. raw . txt 
Output Set: N:\CRF4\09082003\l830691A.raw 
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ORGANISM: Phaffia rhodozyma 
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c; £ 
Oj 


<zzz> 


LOCATION: (30) . . 
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SEQUENCE: 2 
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ORGANISM: Phaffia rhodozyma 
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SEQUENCE: 3 






























87 




Met Val Asn 


Val 


Pro 


Lys 


Thr 


Arg 


Arg 


Thr 


Tyr 


Cys 


Lys 


Gly 


Lys 


Ala 




88 




1 




5 










10 










15 






89 




Cys Lys Lys 


His 


Thr 


Pro 


His 


Lys 


Val 


Thr 


Gin 


Tyr 


Lys 


Lys 


Gly 


Lys 




90 






20 










25 










30 








91 




Asp Ser lie 


Phe 


Ala 


Gin 


Gly 


Lys 


Arg 


Arg 


Tyr 


Asp 


Arg 


Lys 


Gin 


Ser 




92 




35 










40 










45 










93 




Gly Tyr Gly 


Gly Gin 


Thr 


Lys 


Pro 


Val 


Phe 


His 


Lys 


Lys 


Ala 


Lys 


Thr 




94 




50 








55 










60 












95 




Thr Lys Lys 


Val 


Val 


Leu 


Arg 


Leu 


Glu 


Cys 


Ser 


Val 


Cys 


Lys 


Tyr 


Lys 





file://C:\CRF4\Outhold\VsrI830691A.htm 



9/8/03 



Page 3 of 8 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/830 , 691A 
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TIME: 11:27:28 
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65 70 75 80 

Met Gin Met Thr Leu Lys Arg Cys Lys His Phe Glu Leu Gly Gly Asp 

85 90 95 

Lys Lys Thr Lys Gly Ala Ala lie Ser Phe 
100 105 

SEQ ID NO: 4 
LENGTH: 741 
TYPE: DNA 

ORGANISM: Phaffia rhodozyma 
FEATURE: 

NAME /KEY : misc_f eature 
LOCATION: (0) ... (0) 

OTHER INFORMATION: n=a, t, c, or g 
SEQUENCE: 4 

ctcgagtgga cggtggcaat ggcattcgtg tcgttggtgc tcactcgcaa 
cgcttacccg gggtagcctc cgggtgggcg cgatgatttg tggtgtggat 
tgggtagaac gacgcgcaac caatcattcg gagaaccgct ccgttgtagc 
gattgatcaa catgccagca cgtcctccgg gacggagact ggcggggatc 
tggaatcgct ggctcaatgg tagtagtctt cacgatcggc catgagggca 
gttcgcctgc cgaagactgt gtgagtgtgc tganaactaa ttgagtaccg 
caaggcgtgt ntggttgccg gtggctgtga gcgagtttgc tgcaaagcga 
cccggcttgg ccagcgcgct gcgtcacgaa acacactaaa cggttgacgc 
taacacactc aagtttgtgg tcccgggtgg gcctctgtgc ctgcgtggga 
gaggaaaacg ttctgtggcc ctctcctctg tggatagtta cctggttgat 
gtcatatgct tgtctcaaag attaagccat gcatgtctaa gtataaacaa 
tgaaactgcg aatggctcat taaatcagtt atagtttatt tgatggtacc 
gataactgtg gtaattctag a 
SEQ ID NO: 5 
LENGTH: 23 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: CYH1, a PCR primer for the cloning of L41 genomic 
DNA fragment 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (0) ... (0) 

OTHER INFORMATION: n=a, t, c, or g 
SEQUENCE: 5 

cgcgtagtta aygtnccnaa rac 23 

SEQ ID NO: 6 
LENGTH: 25 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: CYH3, a PCR primer for the cloning of L41 genomic 
DNA fragment 
SEQUENCE: 6 

cccgggtytt ggcyttyttr 'tgraa 25 



cccaagcagt 60 
tccttcccta 120 
cgaccagtct 180 
gtacctcatc 240 
gtctaggtgg 300 
ggggataagg 360 
ttcaatgcac 420 
cataaagtaa 480 
cccgacggga 54 0 
cctgccagta 600 
attcatactg 660 
ttgctacatg 720 
741 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/830 , 691A 



DATE: 09/08/2003 
TIME: 11:27:28 



Input Set : N: \EBONY 1 S\US09830691A. raw . txt 
Output Set: N:\CRF4\09082003\l830691A.raw 



149 <210> SEQ ID NO: 7 

150 <211> LENGTH: 24 

151 <212> TYPE: DNA 

152 <213> ORGANISM: Artificial Sequence 

153 <220> FEATURE: 

154 <223> OTHER INFORMATION: 3 f RACE primer 

155 <400> SEQUENCE: 7 

156 ggtcagacca agcaagtttt tcac 24 

158 <210> SEQ ID NO: 8 

159 <211> LENGTH: 24 

160 <212> TYPE: DNA 

161 <213> ORGANISM: Artificial Sequence 

162 <220> FEATURE: 

163 <223> OTHER INFORMATION: 5' RACE primer 

164 <400> SEQUENCE: 8 

165 gtgaaaaact tgcttggtct gacc 24 

167 <210> SEQ ID NO: 9 

168 <211> LENGTH: 24 

169 <212> TYPE: DNA 

170 <213> ORGANISM: Artificial Sequence 

171 <220> FEATURE: 

172 <223> OTHER INFORMATION: sense primer for the mutagenesis of L41 gene 

173 <400> SEQUENCE: 9 

174 ggtcagacca agcaagtttt tcac 24 

176 <210> SEQ ID NO: 10 

177 <211> LENGTH: 24 

178 <212> TYPE: DNA 

179 <213> ORGANISM: Artificial Sequence 

180 <220> FEATURE: 

181 <223> OTHER INFORMATION: antisense primer for the mutagenesis of- L41 gene 

182 <400> SEQUENCE: 10 

183 gtgaaaaact tgcttggtct gacc 24 

185 <210> SEQ ID NO: 11 

186 <211> LENGTH : 20 

187 <212> TYPE: DNA 

188 <213> ORGANISM: Artificial Sequence 

189 <220> FEATURE: 

190 <223> OTHER INFORMATION: a PCR primer corresponding to 18S rDNA 

191 <400> SEQUENCE: 11 

192 tcctagtaag cgcaagtcat 20 

194 <210> SEQ ID NO: 12 

195 <211> LENGTH: 20 

196 <212> TYPE: DNA 

197 <213> ORGANISM: Artificial Sequence 

198 <220> FEATURE: 

199 <223> OTHER INFORMATION: a PCR primer corresponding to 18S rDNA 

200 <400> SEQUENCE: 12 

201 ttcggccaag gaaagaaact 20 
203 <210> SEQ ID NO: 13 
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RAW SEQUENCE LISTING DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A ■ TIME: 11:27:28 

Input Set : N: \EBONY ' S\US0 9830 691A. raw. txt 
Output Set: N:\CRF4\09082003\I830691A.raw 

204 <211> LENGTH: 20 

205 <212> TYPE: DNA 

206 <213> ORGANISM: Artificial Sequence 

207 <220> FEATURE: 

208 <223> OTHER INFORMATION: a PCR primer corresponding to 28S rDNA 

209 <400> SEQUENCE: 13 

210 aatcggatta tccggagcta 20 

212 <210> SEQ ID NO: 14 

213 <211> LENGTH: 20 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 

216 <220> FEATURE: 

217 <223> OTHER INFORMATION: a PCR primer corresponding to 28S rDNA 

218 <400> SEQUENCE: 14 

219 gctataacac atccggagat 20 

221 <210> SEQ ID NO: 15 

222 <211> LENGTH: 2192 

223 <212> TYPE: DNA 

224 <213> ORGANISM: Phaffia rhodozyma 

225 <400> SEQUENCE: 15 



226 aagagctatt tgaatgacga ccacaagagt gacgatcata ttgagcatag tataccaaag 60 

227 gccaagaggc tgtgtggtgt tctatgagtg gccttgatta tgtgttacat aaataaactg 120 

228 atctcaattt ttcaaatact tgccaacact ttcatatatt cacaccaaaa aaagtcagat 180 

229 tggcccacaa agtcagatac acgctcgatc gtcgacgggt tcaagcactt tgtcaggcga 240 

230 aagaaaggcc acagcaccac ccttcaagtc tcgtctcaat caggttcgtc tagctttttg 300 

231 tgtgcaagga tttaccgtct tgatggattt gttcgttgaa agagaggaaa gaaca'tgctg 360 

232 aactgacgaa agtgtgaaca aaaaattgtg attttttcat tgtgtttcgc tggtctcctt 420 

233 gctgggttgg gttggatcgg atttatcttc tgtgttggat ggaaaaccct gaatgttctt 480 

234 ttcttggaca tcttctaaac tcgacaaaac gattcattcc tccgtactgc tctggttctg 540 

235 cctttttgaa tcgcatcgat aaattcttcc ctcggaacgt tcgatcaatc tccgtcaaac 600 

236 ttatcatcca aaaatctctt ctcgactgcc gccttgctcc ttttcttcgt tctttcctta 660 

237 atccgctttc gactaccctc cttctcttca cactcatagt caagatggtc aacgttccca 720 

238 agactcgacg tgagttatag caatttcaac aactctccag acgacaaata ttccagtgca 780 

239 tcgaaagagt ttgtggataa acgcgacagt ttcaagggaa agagtcgatg gacagatttg 840 

240 gaagacttag ccggtcaagg aacttgggga tcacgtggcg gaggactcat cagaagaagt 900 

241 cgggatttgt ttgatcatag tgggatcaag acaaactgga ggatatggct cgccttggaa 960 

242 gggaatctcc ggcctggatt cgaggatccg aaagttgtac gtatggaaaa gcttacacgg 1020 

243 cttggattta ttatctttca taggaaccta ctgcaagggt aaggcttgca agaagcacac 1080 

244 gtaagtcgct tatcctctcc actctttcat ggcatattgt caacgactgg acaacgcgtc 1140 

245 cgttttgaaa caagtgactt acctgtgaaa tttgattcta cacctgtatt tagccctcac 1200 

246 aaggtacata tcacatcctc ccaccccacc ctgcccaact tcttcagttc atcttgctct 1260 

247 cggtttccac attccctgat gacctccttg tatgttcttt gcgaacgttt gtttctgttt 1320 

248 ctgtaggtga cccagtacaa gaagggaaag gactccatct tcgcccaggg aaagcgacga 1380 
24 9 tacgaccgaa agcagtccgg ttacggaggt cagaccaagc ccgtttttca caagaaggct 1440 

250 aagaccacca agaaggtcgt ccttcgattg ggtacgtttt tgtttatttt gaattctttt 1500 

251 tgtgtatgca gacttttgat gattatgctc ctctgtcgtt ttttctcttc aaacagagtg 1560 

252 ctccgtctgc agttcgttct tccttccaac caaaacttca actacagaca tcataaacag 1620 

253 acatcttact tcggtgttct ctcttttttt ccgcagagta caagatgcag atgaccctca 1680 

254 agcgatgcaa gcacttcgag cttggaggag acaagaagac caagggttcg tcttttgtcc 1740 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A TIME: 11:27:29 

Input Set : N: \EBONY 1 S\US09830691A. raw. txt 
Output Set: N:\CRF4\09082003\I830691A.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:4; N Pos. 334,371 
Seq#:5; N Pos. 15,18 
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VERIFICATION SUMMARY DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A TIME: 11:27:29 

Input Set : N: \EBONY ' S\US09830691A. raw. txt 
Output Set: N:\CRF4\09082003\l830691A.raw 

L:13 M:270 C: Current Application Number differs, Wrong Format 
L:116 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos.:300 
M:341 Repeated in SeqNo=4 

L:137 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:5 after pos . : 0 
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STATISTICS SUMMARY DATE: 09/08/2003 

PATENT APPLICATION: US/09/830 , 691A TIME: 11:27:29 

Input Set : N: \EBONY 1 S\US09830691A. raw. txt 
Output Set: N:\CRF4\09082003\l830691A.raw 

Application Serial Number: US/09/830, 691A 

Alpha or Numeric or Xml : Numeric 

Application Class: 

Application File Date: 04-26-2001 

Art Unit: PCT 

Software Application: FastSEQ 
Total Number of Sequences: 20 
Total Nucleotides: 4766 
Total Amino Acids: 224 
Number of Errors: 0 
Number of Warnings : 3 
Number of Corrections: 1 

MESSAGE SUMMARY 

270 C: 1 (Current Application Number differs) 
341 W: 3 ((46) "n" or "Xaa" used) 
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